Efficient Attention Pyramid Network for Semantic Segmentation
نویسندگان
چکیده
Semantic segmentation is a task that covers most of the perception needs intelligent vehicles in an unified way. Recent studies witnessed attention mechanisms achieve impressive performance computer vision task. Current based methods differ with each other position and form mechanism, perform differently practice. This paper firstly introduces effectiveness multi-scale context features tasks. We find channel can play vital role constructing effective features. Based on this analysis, proposes efficient pyramid network (EAPNet) for semantic segmentation. Specifically, to handle problem segmenting objects at multiple scales, we design (ECAP) which employ atrous convolution cascade or parallel capture by using rates. Furthermore, propose residual fusion block (RAFB), whose purpose simultaneously focus meaningful low-level feature maps spatial location information. At same time, will explore different modules modules, describe their impact performance. empirically evaluate our EAPNet two datasets, including PASCAL VOC 2012 Cityscapes datasets. Experimental results show without MS COCO pre-training any post-processing, achieved 81.7% mIoU validation set. With deeplabv3+ as benchmark, improve model more than 1.50% mIoU.
منابع مشابه
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
We introduce a fast and efficient convolutional neural network, ESPNet, for semantic segmentation of high resolution images under resource constraints. ESPNet is based on a new convolutional module, efficient spatial pyramid (ESP), which is efficient in terms of computation, memory, and power. ESPNet is 22 times faster (on a standard GPU) and 180 times smaller than the stateof-the-art semantic ...
متن کاملLaplacian Pyramid Reconstruction and Refinement for Semantic Segmentation
CNN architectures have terrific recognition performance but rely on spatial pooling which makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper makes two contributions: (1) We demonstrate that while the apparent spatial resolution of convolutional feature maps is low, the high-dimensional feature representation contains significant sub-pixel localizat...
متن کاملSemantic Segmentation with Reverse Attention
Recent development in fully convolutional neural network enables efficient end-to-end learning of semantic segmentation. Traditionally, the convolutional classifiers are taught to learn the representative semantic features of labeled semantic objects. In this work, we propose a reverse attention network (RAN) architecture that trains the network to capture the opposite concept (i.e., what are n...
متن کاملVolumetric Semantic Segmentation using Pyramid Context Features Supplemental Material
Our “pyramid filtering” insight enables exact and extremely efficient per-voxel classification with minimal memory overhead. We will now demonstrate our improvement over existing techniques empirically and theoretically. For an empirical demonstration of efficiency, consider the following two alternatives to pyramid filtering: 1) the “sliding window” approach: iterate through every voxel in the...
متن کاملStacked Deconvolutional Network for Semantic Segmentation
Recent progress in semantic segmentation has been driven by improving the spatial resolution under Fully Convolutional Networks (FCNs). To address this problem, we propose a Stacked Deconvolutional Network (SDN) for semantic segmentation. In SDN, multiple shallow deconvolutional networks, which are called as SDN units, are stacked one by one to integrate contextual information and guarantee the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3053316